arXiv:1504.07328vl [math.PR] 28 Apr 2015 


ASYMPTOTIC NORMALITY OF DEGREE COUNTS IN A 
PREFERENTIAL ATTACHMENT MODEL 

SIDNEY I. RESNICK AND GENNADY SAMORODNITSKY 


Abstract. Preferential attachment is a widely adopted paradigm for under¬ 
standing the dynamics of social networks. Formal statistical inference, for 
instance GLM techniques, and model verification methods will require know¬ 
ing test statistics are asymptotically normal even though node or count based 
network data is nothing like classical data from independently replicated ex¬ 
periments. We therefore study asymptotic normality of degree counts for a 
sequence of growing simple undirected preferential attachment graphs. The 
methods of proof rely on identifying martingales and then exploiting the mar¬ 
tingale central limit theorems. 


1. Introduction 

Preferential attachment is a widely adopted model for understanding social net¬ 
work growth. The assumption posits that nodes with a large number of existing 
connections are more likely to attract connections from new nodes joining the net¬ 
work. This paradigm is one of the justifications for perceived power law behavior. 

Statistical analyses of social networks is complicated by the fact that node based 
data is nothing like classical iid data obtained from repeated sampling. Efforts 
at statistical estimation and model confirmation center around graphical based 
slope methods, regression models for the center of the data and tail estimation 
methodology. These statistical techniques are adaptations of classical methods but 
currently are largely without justification. Yet they produce reasonable answers 
and plots. 

We begin a program for justifying statistical methodology by examining the 
asymptotic normality of counting variables for the number of nodes of degree k 
in a simple undirected preferential attachment model described in [III Chapter 8]. 
We let N n (k) be the number of nodes of degree k at the nth stage of development 
of the network. It is known N n (k)/n —»■ pk and for our model, the sequence {pk} 
can be exhibited explicitly. The martingale central limit theorem allows us to 
prove asymptotic normality for ^Jn{Nk{k)/n — pk)- This emphasizes consistency 
of the empirical count percentages as estimates of {pk} and provides confidence 
statements for the estimates. 
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We describe the undirected preferential attachment model in Section 11.11 and 
give known mathematical results that we need in Section [3] Asymptotic normality 
is considered in the simplest case k = 1 in Section [3] and for the general case in 
Section [4] 


1.1. The model for preferential attachment. We consider a simple growing 
undirected graph with preferential attachment that is outlined in [111 . Chapter 
8] or Q. The random graph at stage n is G n = ( V n ,E n ), the set of nodes or 
vertices is V n := {l,...,7i} and the set of undirected edges is E n , a subset of 
{{i, j} : i,j £ V n }. For v £ V n , let D n (y) be the degree of v at stage n ; that is, the 
number of edges incident to v. As a convenient and harmless initialization, assume 
Vi consists of a single node 1 with a self-loop so that D\(l) = 2. 

Conditional on knowing the graph G n , at stage n + 1 a new node n + 1 appears 
and with a parameter 6 > — 1, either 

(1) The new node n + 1 attaches to v £ V n with probability 


( 1 . 1 ) 


D n (v) + 5 
n(2 + 5) + (1 + S) ’ 


or 

(2) n + l attaches to itself with probability 


( 1 . 2 ) 


1 + 6 

n(2 + 6) + (1 + 6) 


In the first case D n+1 (n + 1) = 1 and in the second case D n+1 (n + 1) = 2. It is 
standard for this model that 


y: D n (v )=2 n 

vev n 

and thus the attachment probabilities in m and OD add to 1. 


2. Martingale Central Limit theorem 

Martingale central limit theorems have been used for a long time. In order 
to make the paper self-contained, we present the statements we will need in this 
section. We start with a one-dimensional martingale adaptation of the Lindeberg- 
Fcllcr central limit theorem (0, Chapter 8] or jb}). 

Proposition 2.1. Let {X n ^ m , JF n ^ m , 1 ^ m ^ n} be a square integrable martingale 
difference array satisfying 

(1) Vn n := l) ~^ a as n —> OO. 

( 2 ) T,rn^n E ( X l,ra l [\X ntm \>e}\Xn,rn-l) 4 0 US 71 -> OO for all £ > 0. 

Then as n —> oo, 

n 

(2.1) 5Z Xn,m^ N(0,a 2 ). 

m= 1 

Using the Cramer-Wold device, it is not hard to extend the statement of Propo¬ 
sition [2J] to the multivariate case. 
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T 

Proposition 2.2. Let {XJFn^mi 1 ^ VYl ^ > X nrn — 1 ; • ■ * j -^n,m,d) j 

be a d-dimensional square-integrable martingale difference array. Consider the dxd 
nonnegative definite random matrices 


G 


n,m 


fax, 


n,m,2-Xn,m,j 


\j~n,m— l) 5 j — •> 


V n 


n 

^ ^ Gn,mi 
m —1 


and suppose (A n ) is a sequence of l x d matrices with a bounded supremum norm. 
Assume that 

(1) A n V n Al 4 E as n —> oo for some nonrandom (automatically nonnegatively 
definite) matrix E. 

( 2 ) J2m^n E ( X n,m,i 1 [\X n , m , i \>e]\J :r n,m-l) 4 0 OS U ->• OO far all i = 1, . . . , d 
and e > 0. 

Then in R*, 


(2.2) 'y ( A n X n ^ m => X, (n —> co ), 

m—1 

a centered l-dimensional Gaussian vector with covariance matrix E. 


Proof. By the Cramer-Wold device, it is enough to prove that for any vector b £ R ; , 

n 

J2b T A n X n , m ^N(0,b T Zb) . 

m= 1 


Since {b T A n X n,mi 1 ^ rn G n | is a one-dimensional square integrable mar¬ 

tingale difference array, we only need to check the conditions of Proposition 12.11 
Since 


^ Efa T A n X ntm ) 2 |.F n , m _i) = b T A n V n A 


T b 4 6 T Eb 


ra$Cn 


as n —y oo, the first condition of that proposition holds. For the second condition, 
notice that for every e > 0 we can write 


i 

\b A^X. ri,m I ^ d max bj An(^j ) z) I ^ 6 max i 

3 =1 

where 6 is a finite positive constant depending on the vector b and an upper bound 
on the supremum norm of the sequence (A n ). Therefore, 


^ E((b T A n X ntrn ) 2 1 

m^n 


b T A n X nt m 


T, 


n,m —1 


€ 


E 

m<n 


E 


max 


\x„ 


:|W 


i\>e/9 


Fn 


d 

< 02 E E E i X lmA \X„, m ,\>e/e\ E n,rn-l) 0 

i— 1 m^n 

by the second assumption of the present proposition. This verifies the assumptions 
of Proposition l2Tl and, hence, completes the argument. □ 
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3. Asymptotic normality of degree counts 

For k Js 1, let N n (k) be the number of nodes in G n with degree k: 

Nn(k') 'z ^ 1 [D n (v)=k]‘ 
veV n 

Using concentration inequalities and martingale methods it is shown, for instance 
in [111 . Chapter 8 ] that there is a probability mass function {pk,k ^ 1} such that 
almost surely as n —> oo, 

(3.1) fc>l, 

n 

and 

„„ f(2 + S)r{3 + 2S)\ T{k + S) T(k + S) 

(3 - 2) Pk = l T(l + 6) ) T(k + 3 + 26) = C{S) nk + 3 + 2SY 

Of course T(-) is the gamma function. In particular, for k = 1 

2 + 6 

Pi = 


-3-5 


3 + 2(5’ 

and as k —> oo, power law behavior is 

p k ~ c(6)k~ 

We consider the asymptotic normality of N n {k)/n — pk for k ^ 1, and we start 
with the simplest case of k = 1 . 

When proving the asymptotic normality for the number of nodes of degree 1, we 
will use the abbreviations 

V-n = E(N n (l)), 7 = „ fj Wn = “ • 

2+6 n +7 

Note ATi(l) = Vi = 0 since the initial node has a self-loop so D i(l) = 2. 

Let J~ n be the information from observing the growth of the network up through 
the nth stage. Then 

E(N n+ i(l)\T n ) = N n {l) + E({N n+1 { 1) - JV n (l))|.F n ) 

= N n (l) + E(l [ 

n +1 links to vEV n ;D n (v)>l] | Fn) 

= N n (l) + 1 — P[n + 1 links to itself \P n ] 

— P[n + 1 links tore V n ; D n (v) = l|.F n ] 

and using dD and the fact that for the last term D n (v) = 1, we get 

1+6 1+6 


(3.3) 


= JV n (l) + l- 
= JV„(1)(1- 


i (2 + 6 ) + (1 + (5) n (2 + (5) + (1 + (5) 


N n (l) 


7 


n + 7 
= w n N n (l) + w n . 


) + (l- 


n + 7 


We conclude 
(3.4) 


E(N n+1 (l)\E n ) = w n N n (l) +w n , 
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(3.5) 


v n+ l = w n v n + w n . 


We claim 

( 3 . 6) M n+ \ := Nn+ l { P ~ ^ +1 = Nn+l{1) ~„ E " =1 ^ Wj , n> 1 


nu w > 


UU w i 


is a martingale. This is verified using m and (H^- 
Consider now the martingale difference 


d n +i '-—Mn+i — M n — 
1 


^n+l(l) T-ra+l -Wi/l) Vn 


rr= 


3 =1 


n n —1 

i=i 


(3.7) 

As above, 


n;=i«b 

i 

n " n 

,•=i w i 


^-^n+l(l) ^n +1 (^n-^n(l) 

(iV n+ 1 (l) - iV n (l) + JVn(l)(l - W») - ™n) • 


-^n+l(l) -^n(l) • A n _|_i l[n+l links with DGV n ;T n (v)>l ]5 


and A^ +1 = A n +i. Therefore, 
d 2 - 1 

a n+l — 


n -(a„ + i + 2A„ + i(iV„(l)(l - w„) - Wn) 

(n i= i ^) 2 v 

+ (iV n (l)(l - W n ) - W)„) 2 ^ 

TFfn— - ^(A n+ i{l + 2((1 - W n )N n ( 1) - W n )} 

dUl Wj) 2 V 

+ ((1 - W n )N n ( 1) - W'n) 2 ). 


Since 


E{A n +i\T n ) = E(N n+1 { 1 ) - N n (l)\T n ) = 1-f—(1 + iV„(l)), 

n + 7 


we have 

E(d 2 n+ llA) = f'TT n 1 vd * 1 + 2 ((! - ^n)At n (l) - «;„)}(l - -J-(l + N n ( 1 )) 

(n,=i«b) 2 v ^+7 

+ [(1 - Wn)N n ( 1) - W n ] 2 ) 

(38) =: (n5WF x ”- 

We need the following observations: 

(a) w n = n/(n + 7) —> 1, as n —> 00. 

(b) 1 — w n = 7/(71 + 7) ~ 7/ra -»• 0. 
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(c) Owing to the usual recursion on the gamma function, 


T(n + 1 + 7 ) 


r(i + 7) 

(d) Therefore we have 

n n 

n^noi-) 


= n (n + 7-j) = n(7 + 0- 
1=0 1=1 


T(n + 1 ) 


1=1 


(3.9) 


L J[W+ 7 ' T(n +I+ 7 ) 


-r(l+ 7 )n “ 7 = r(—)n “ 7 
Pi 


r(i + 7) 


by Stirling’s formula. 

(e) We have 

,. N n { 1) v n 2 + 5 

Inn - = lim — = p 1 = --. 

n—y 00 n n—y 00 ri 3 + 25 

This allows us to evaluate lim „_>. 00 \ n in (13.81) as 

c 0 (< 5 ) :=(l + 2(7^! - 1)][1 - 7^1]^ + [+pi - l) 2 = 7Pi(l - 7 Pi) 


, N /i + iwHO 

^ 3 ' 10 ^ (3T25) = 

Therefore from (13.91) . (13.81) and (13.101) 


(1 + <5) (2 + 5) 
(3 + 2 5) 2 ' 


(3.11) 

with 


E{d 2 \F n ) 


cq{5) 


(T(l + 7 )n 7 ) 2 


=: ci(5)n 2j 


c 1 (5) = c 0 (5)/(F(l + 1 )) 2 . 


Keeping in mind that the asymptotic equivalence in (13.111) holds a.s, we apply 
Karamata’s theorem on integration (eg, [3, page 25] or [2|), to obtain 


(3.12) ^K(d 2 +1 IJ 7 ) 


1=1 


c iW . 

2 7 + 1 


+ 7+1 — +2 


=: a 2 (5)n 2l+1 , (n—> oo), 


with 

(3.13) 

Now set 


\5) = 


(1 + 5)(2 + 5) 


(3+ 25) 2 (27+l)(r(l+ 7 )) 2 - 

dm. 


dn ’ m ~ a(5)n 7 +!/2’ 


1 ^ to ^ n. 


and the first condition of Proposition 12.11 is satisfied. For the second condition of 
Proposition 12.11 observe that from (13.71) . we get by ignoring constants and remem¬ 
bering |A m +i| ^ 1 , that, asm + oo, 

[|d n ,771+1 | > e] =[|dm+i| > en 7+1/2 ] 

1 


C 


n m 

1=1 w : 


■|[1 + N m ( 1)(1 - w m ) - w m \ > en 7+1/2 ] 
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C [cm 7 1 (const)| > en 7+1//2 ] 
C[cn 7 | (const)] > en y+1 ^ 2 }. 


So with probability converging to 1 as n —> oo, all the indicator functions l[|d„ m |>e] 
vanish, and this verifies the second condition for the central limit theorem. 

We conclude from Proposition 12.11 that m holds. Unpacking (HU, we find 




m =1 


Use (13.91) to get 


—^ cr(5)n 7 + 1 / 2 cr(<5) 

N n ( 1 ) \ _ N n U r 

n n ) yfn 


]_ _ ( N n (l) -v n \ 

^ 7 + 1/2 ) ' 


■ 3=1 


N{0,ai(S)), 


where eri(<5) = r(l + 'y)a(6). Since there exists K such that 


OO 


V Wn ~npi\ < K, 

n= 1 

(eg, [ll|, Section 8.5]), we may conclude the following. 


Proposition 3.1. 

normal, 


The number of nodes at stage n with degree 1 is asymptotically 

~Pi ) =*• -^V(0,o-i(5)), 


where 


(1 + 5 ) (2 + S ) 2 

~ (3 + 2<5) 2 (4 + 3<5) ’ 


4. Normality for the number of nodes of degree k , k > 1. 

The results of Section [3] show how the martingale central limit theorem can be 
used to prove that the deviation of the fraction of the nodes in the nth graph, 
that have degree 1 , from their limiting fraction, is of the order n~ 1 / 2 and, when 
normalized by that quantity, has a limiting centered normal distribution; this is 
the content of Proposition 13.11 It turns out that this distributional result is valid 
for all node degrees simultaneously. More specifically, a central limit theorem in 
R n holds, and the limit is a centered Gaussian process. 

Theorem 4.1. The proportion of nodes with given degrees satisfy the limiting 
distributional relation 

(4.1) n ^ ~ Pk ) ’ k = 2 ’ ‘'') ^ ( Zk ’ k = 1; 2 ■ • ■) 

in R N , where (Zk, k = 1,2 ...) is a centered Gaussian process with covariance 
function Rz given by (14.281) . 
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Remark 4.1. We precede the proof of the theorem with a number of comments. 

First of all, weak convergence in R N is equivalent to convergence of the finite- 
dimensional distributions; see jlj. 

It is clear also that, for every fixed n, the stochastic process in the left hand side 
of (14.11) will have at most n random elements; however, all elements in the limiting 
process are random. 

Finally, the variance of Z\ in the right hand side of m is given in Proposition 

EU 


Proof of Theorem \ f.l\ As in the one-dimensional case of Section El we will use a 
martingale central limit theorem, so we start with constructing a suitable martin¬ 
gale for each fixed node degree. For k = 1,2... we denote v n {k) = E(N n (k)), 
k (so that v n = z/„(l)). It follows from (14.11) and bounded convergence, 

(4.2) 1. 

n 

Let 

(4.3) 
so that 

( 4 - 4 ) “"+i “ V" n {2 + 6) + 

and for future use, note that by the Stirling formula, 

(k) r(n + (1 + 5)/ (2 + 6))T(k + (1 - k)/ (2 + 6)) 


a! fc > = 


n— 1 


n i 


_ i—k 


k + 6 

i{ 2 + <5) + 1 + 6 


A k. 


( fc ) — n ( k ) (1 _ 

■ki _L 1 - Ur, \ -L 


k + S 


TTsY 1 ’ 


(4.5) 


a' ' = 


T(k + (1 + S )/(2 + 5))T(n + (1 - k)/(2 + 8)) 


r(fc + (1 - k)/ (2 + (5)) (k+5)/(2+5) 


(n —> oo) 


T{k + {l + S)/{2 + 6)) 

so that as a function of n, is regularly varying with index (k + S)/(2 + S). Also 
define, 


(4.6) 


- n 


fc-1 . x 
1 + 0 


( i) k ~i ^ 1 < ? < k 

i~k { } (k-j)\r(j + sy ^ J ^ ■ 


We use the usual conventions to set = 1, and set 


(4.7) 


= a ^ bf (. Nn(j) - u n {j)),n>k. 
4=1 


The process (M^) coincides with the process (M n ) defined in (13.61) and we already 
proved the latter process is a martingale with respect to the filtration (_7 r n j. Now 
we check that for each k ^ 1 the process {Mn ' 1 , n ^ k) is a martingale with respect 
to the same filtration. 
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Recall the dynamics of the of the counting processes ( N n (k )): for each fixed n, 
there is a partition of the sample space into disjoint events A n (k), k = 0,1,.. 
n ^ 1, n ^ fc, with 


(4.8) 


P{A n (k) \T n ) 


n( 2 + 6 )+(l+ 5 )Nn(k), k — 1, . . . , 12 

_ 1+6 _ u — n 

n(2+5)+(l+<5) > «--U. 


The event A n (k) is the event that a new node appears at stage n + 1 and attaches 
to v £ V n with D n {v ) = k while A„(0) is the event that a new node attaches to 
itself. In terms of these events, for each k ^ 3 and n ^ k, 


(4.9) 

for k = 2 and n 2 


N n (k) + 1 on A„(fc - 1), 
N n+1 (k) = { N n (k) — 1 on A n (k), 
N n (k) 




N n {2) + 1 oni„(0)UA„(l), 

(4.10) N n+ 1 (2) = { N n (2 ) - 1 on A n ( 2), 

iV n (2) on Uj> 3 ^n(j), 


while for k = 1 and n ^ 1, 

(4.11) JV n+ i(l) = 

In particular, for n ^ fc, 

(4.12) £?(JV n+1 (*:)|J- n ) = ( 1- 


AT„(1) + 1 on Uj^ 2 A„(j) , 
7V„(1) onA„(0)U4„(l). 


k + S 


n(2 + 6) + (1 + 6) 
k - 1 + 5 


N n {k ) 


(4.13) 


E{N n+1 ( 2)\F n )= 1- 


n(2 + <5) + (1 + 6) 

2 + 6 


n(2 + 6) + (1 + 5) 

1 + 6 


N n (k — 1), k^3. 
N n { 2) 


n(2 + 6) + (1 + 6) 


(l + N n (l)). 


Taking the expectation, we see that 


(4.14) 


(4.15) 


v n+ i(k) = 11- 


k + 6 


n( 2 + 6)+ (1 + 8) 
k — 1 + 6 


V n {k) 


u n +1(2) — ( 1 — 
+ 


n(2 + <5) + (1 + 6) 

2 + 6 


n(2 + 6) + (1 + 6) 

1 + 6 


v n (k - 1), k > 3 , 

v n { 2 ) 


n(2 + 6) + (1 + 6) 


(l + v n (1)) • 
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The corresponding dynamics for k = 1 is given in (13.31) . Therefore, for n ^ k, 


E(M^ n ) = k) E[(N n+1 (j) - u n+1 {j))\E n \ 

j=i 




.7—2 


E & . 

1=2 

j - 1 + (5 


1 - 


j + 5 


(-^n(i) - ^n(j)) 


i(2 + <5) + (1 + <5) 


'fc-l r 


423 £ 

l i=i 

- 


fef } ( 1 - 


j + 5 


3+1 n(2 + (5) + (1 + S) 

(k) 


n(2 + <5) + (1 + S) 

( N n (j - 1) - V n {j - 1)) 

3 + S \ 

i(2 + 6) + (l + S)J 

{Nn{j) - Vn{j)) 

(N n (k) - V n (k )) l , 


+ fr K > I 1 _ k + S _ 

k \ n{2 + 5) + {l + 5) 

and elementary calculations show that this is the same as the right hand side of 
(TO) . Therefore for each k. the process (M ^, n ^ fc) is indeed a martingale with 
respect to the filtration (E n )- 

For k = 1,2,... define a /c-variate triangular array of martingale differences by 


(4.16) 




Mg - 


agn 1 ! 2 


m = k + 1 ,..., n, j = 1,..., k , 


for n = k, k + 1,.... In order to use the multivariate martingale central limit 
theorem in Proposition 12.21 we compute the asymptotic form of the quantities 


Em — 1 ) •> 


(4.17) G n,m(J'ij ) •— E(^X n ,m,iXn,m,j \E m—l) 

= (a«a«n)" 1 E((MW - M«_ 1 )(M« ) - ,) 

to = /c + 1,..., n, i, j = 1,..., k. By the martingale property, 

(4.18) 

~ AfW) (M^ +1 - Mg) | E m ) 

= e\j 2 bf { a m+lNm+l ~ agNg) jz b\ j) { a m+lNm+l ~ ag Ng) 


Ld=l 


Xrr 




w r„(o.. „(<*). _ v' /.w) 


i=l 


iA l > 

b m-\-l u 'm-\-l u> m ,y m)/_^ u l \ a m+l u m+l a m u m ) • 
Z=1 
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We begin by analyzing the behaviour of the deterministic term in the right hand 
side of (14.181) . We claim that for every i ^ 1 , 


(4.19) 


1 


lim -p- S^ by ( 


(i) /„(*) ,,( d ) 


,(*) 

d=l 


( 0 , 


®n+l^n+l 


< d) ) = 6 ? 


(i) 


with 6 ^ given by (14.61) . Indeed, for 3, by (14.31) and (14.141) we have 


-aWi/M =a. (< ) 


°n+l t 'n+l a n 


ra +1 1 "ra 


,(<*)_ 


i — d 


i (2 + <5) + 1 + 8 


+ v 


id- 1 ). 


rf-1 + 5 \ 

i (2 + (5) + 1 + S J 


(i - d)p d + (d- 1 + 5)f>d-i m 
2 + 5 ° n 

as n —> oo by m , the fact that a^ +1 ~ and the same is true for d = 2 by 
(14.15|) . Similarly, using (13.31) . we obtain for d = 1 that 


„W J 1 ) 1 T) , | i"\ ff (i) _„(») | „(») 

^n+l^n+l a n ^n \ 2 + 5 ^ ^ 


\2 + 5 

as n —> oo. Therefore (with p 0 = 0), 

1 


1 — 1 

2 + 5 


Pi, 


1™, —(IT H } («i+l4 d +l - »n^n } ) 


n—> oo „(0 - 

d=l 


= ^ * [(* “ d ^ Pd + ( d - 1 + ^)Pd-l] = b' 


« 

1 5 


J =1 


since 


(4.20) 


^]6^ ) [(i-d> d + (d- 1 + 5)p d - 1 ] =^]p d [(j-d)6^ ) + (d + 5)5^ 1 ] =0. 


d=l 


d=l 


Next, by (Oil . (tTTUl) and (1XTTD . 


(i) 

*n+l 


JVn +1 (d) - o«IV n (d) = a^ +1 


N n (d)- 


i + 5 


;(2 + 5) + 1 + 5 


+ B n {d ) 


with 


(4.21) 


Bn(d) 

Bn (2) 
B„(l) 


1 on 4 n (d-l), 

— 1 on A n (d), for d ^ 3, 

0 on Uj^{^_i 5 ^^4 n (_)), 

1 on A„(0)Ud n (l), 

-1 on A n ( 2 ), 

0 on Uj^ 3 A n (j), 

1 On Uf^ 2 ^n(j) , 

0 on A„(0)Ud„(l). 
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Therefore, as n —> oo 
1 


E 


-urnj E b f ( a n+i N n + i (d) - a$N n {d)) E W (a^ +1 N n+1 (l) a&N n {l)) 

% d =l 1=1 


E 


E^Kw 

d=l ' 

J2 b i J) ( N "0) 


i + 5 


i=i 


n{ 2 + 5) + 1 + 5 
j + 5 


+ B n (d) I x 


n(2 + 5) + 1 + 5 


+ B n (l) 


■Bn 


= J2 b d( i + 6 ) 


N n (d) 


d=l 


n{ 2 + 5) + 1 + (5 


Er(i+^) 


JVn(0 


1=1 


n (2 + 5) + 1 + 5 


+ E b d ' ' (* + 5 ) „(2 + /) V 1 + (5 ^ b \ 3)E ( B n(l) \Fn) 


d= 1 


+ E o<+(A.M|A) 

+EE^ )fe P )£ ( B ™( d ) s «(ol^) 

d=l 1=1 

■= Si t n(i,j) + S 2 ,n{i,j) + S 3>n (i,j) + 5 4 ,„(*, j) • 


It follows by (13.11) that 


E 


b^\i + S)- 


N n (d) 

i(2 4- 4- 1 4- 6 


i + 5 


o 1 £ 




tU) 

b d Pd 


a.s. as n 


Next, by (142TL 


E^(Bn(d)|J 7 n)=6i <) 


d=l 


1 - 


1 + 5 


+ li^2 b 2^ 


1 + 5 


n{ 2 + 5) + 1 + 5 


n (2 + 5) + 1 + 5 
(1 + JV„(1))- 


(1 + ^n(l)) 


2 + 5 


+ 1*^3 E b ~ 


d =3 


d — 1 + 5 


i (2 + 5) + 1 + 5 


JV„(d-l)- 


n{ 2 + 5) + 1 + 5 
d + 5 


i (2 + 5) + 1 + 5 


iV„(2) 

iVn(<0 


-+ 6 $° ( 1 - 


^ + ^JTS E ’ (( d - 1 + 5 )^-l - ( d + S )Pd) 


= b l ] + E-T E b d ] (( d - 1 + %<*-! - ( d + S )Pd) 


2 + 5 


d= 1 
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l(») _ i + s 

1 2 + 6 




d=l 


a.s., where at the last step we used (14.201) . We conclude that, with probability 1, 


Si, n {i,j) -+ 


(i + 8){j + 6) ^ , (i) 


(2 + sy 


E+«E+ 


d=l 


Z =1 


* + 5 . +> / , O') 3 + 6 1,0) 




d=l 


2 + <5 


i pi 


i =i 




2 + (5 


j=i 


2 + (5 


d Pd 


d=l 


Finally, we consider the term S^ n {i,j). Note that, by (14.211) . we have the following 
cases. 

(1) On4(0)Ud„(l): 

E(B n (d)B n (l)\F n ) = ld=/=2 • 

(2) On A n (m), m > 2, 

p(R M)R mlp)- / 1 tfd,l€{l,m + l}0Td=l = m, 

* K » nW v n w\j- n) - | _ x \{ d = i {i ; m + 1} or Z = m, d € {1, m + i}. 

Therefore, using the convention b (l ' = 0 if d > i, we can write 

1 + 5 


S+n(i,j) = 


n( 2 + 6) + 1 + 6 


(1 + nW) 


+ ( 5 i l} “ + 6 ™+i) ( 5 i J) “ + 6 m+i); 


m + S 


n( 2 + 5) + 1 + 6 

°° i x 

e -«+e.x"? -«+e.K 




m —1 


a.s. as n —> oo. We conclude that, with probability 1, 


(4.22) 


£ 


77777 C^n+i - MW) (M« - M«) 


(z) (j) 

ah a)i 


F n 


-+ «(bj) 


00 i r 

= : e - 6S+e I )(fS j> -«+e>+ 
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Next we simplify the expression for a(i,j), to make it clear that a(i,i) > 0 for 
alii > 1. Note that, by (13.21) and (14.61) . for each i , 


E^- - cW(-i)T( i+ *)B-i) V, )!r(d+3+M) 

d= 1 d —1 


= ^) ( ~. 1) T (i t/ ) E(- i r 


= <S) 


i + 2 + 25 

(-1 ) T (» + ^) 

i + 2 + 25 


d—l 


1 


1 


_{i- d)\Y{d + 2 + 25) (i — d—l)!T(d + 3 + 25)_ 

1 , 1 


(i — 1)! T(3 + 25) 


+ (-!)’ 


Y(i + 2 + 25) 


where at the last step we used the telescoping property of the sum. Elementary 
algebra now gives us for i ^ 1 , 


(4.23) 


j (i) i + 8 u (i) 2 + 5 (—l ) 1 r(i + 5) 

°1 - / , b d 


2 + 6 7^i d Pd r (! + S ) id ~ f) ! r (* + 2 + 25)' 
Similarly, for i ^ 1, using a telescopic property, 


E 


-(<.?>-«+e.) 2 p m =((.s ,, ) 2j!ffi - ^ r(ra+1+4) 


2 + d 


E 


m=i -\-1 

,(i)s2 C(d) 


2 <5 ^ E(m + 3 + 25) 

m=i+l 


= m 

= (b^ 2 


E 


r(m + 1 + d) r(m + 2 + 5) 


2 + 5 m ^ +i l + 5 [T{m + 2 + 25) r(m + 3 + 2d) 


c{S) 


r(z + 2 + d) 


* (Id - d)(2 + 5) r(i + 3 + 25) 

_ T(3 + 2d) T(i + 2 + d)(r(i + d)) 2 

~ r (2 + d)(r(i + d)) 2 r(i + 3 + 2 d)((* - 1 )!) 2 

Therefore, by (14.23|) . 


*(M) > E 


m + 5 , (i) 


i + 5 


(//*) _ L(i) i M \ 2 _ I + i ) _ 

2 + 5 1 i m m+l) Pm 1 1 2 + d 

m=i-\-l \ d—l 


E^’w 


( 2 + 5 

r (i + s) \ 2 

T(3 + 2d) 

K i - b 2 + 25)r(i + 2 + 5 ) 

Vr(i + <5) (<- 

!)!(* + 2 + 25)J 

(2 + d)T(3 + 5) 

T(z + 2 + 25) 


Note that the expression inside the bracket is greater than 

T(3 + 25) T(i + 2 + 5) 

(2 + d)T(3 + d) T(i + 4 + 25) 

and, since for 0 < b < a, the ratio T(a; + a)/T{x + b) is increasing in x ^ 0 , the 
above difference is, for i ^ 2 , at least 

T(3 + 2d) T(2 + 2 + d) 


(2 + d)T(3 + 5) T(2 + 1 + 25) 


- i =irr i > a - 



































COUNT NORMALITY 


15 


This shows that a(i,i) > 0 for all i ^ 2. The fact that the same is true for i = 1 
can be seen directly from (14.2211 (and was also shown in Section [3|. 

We know from (14.221) that 

nG ntn (i,j) ->• a(i,j), (n ->■ oo), 

and from the definition (14.1711 we have 

a {i) a U) 

yr _ Urn U m (. ^ 

^n,m — r-\ (j) JJ- 

nan an 

and from the regular variation property after m , the function 

u(m) := a$a%rnG mtrn (i,j) 

is regularly varying with index (i + j + 2<5)/(2 + 5). Therefore, from Karamata’s 
theorem on integration of regularly varying functions 


n v / \ 

V n (i,j)= E 

nan 


m—k -\-1 


i(n) 


( l+ 2 +s 8 + 1 ) nan 1 On 


(<)„(*) 


l(b j)~ 


2 + 5 


i + j + 3 5 + 2 


for i,j = 1, ■ • •, k. where a(i,j) is defined in (14.2211 . This verihes the first condition 
the martingale central limit theorem of Proposition [2T2] (with each A n being a k x k 
identity matrix.) The second condition of the theorem holds as well, since by (HU), 
the differences are bounded, 

| ( N n (j ) - v n (j )) - (7V n _i(j) - v n -i(j )) | < 2 for all j , 

hence, as in the one-dimensional case of Section[3l for all n large enough, the events 
{X n , m ,j > e} are empty for all m ^ n and all j. We conclude that 


(4.24) 


E b o k) ( N nU ) - Mi)) , k > 1 1 =» (Y k , k = 1, 2 ...) 


7,1/2 


i=i 


in K n , where (Yk, k = 1,2...) is a centered Gaussian process with covariance 
function Ry given by 


(4.25) 


= - 


2 + 5 


i,3> 1 - 


i + j + 2 + 3<5 

We use this covariance function to define the k x k matrix 

Ry.k = (Ry(i,j), 1 < i,j < k). 

For a fixed k = 1,2... the convergence in (14.241) means that 

'Nn(j)-v n (j) . 


C k 


i 1 / 2 


j — 1 ,..., k ^ => (Yj, j — 1,..., fc) 
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where C k is a k x k matrix with the entries 

c =1 b f 3 ^ i 
1,3 lO j>i. 

Using the easily checkable identity, valid for any real r, 

i 

(4.26) Y, rm ~ jb ^ b j m) = b f0 + r) l ~\ 1 < j < *, 

m—j 

we can check that the inverse of Ck, Dk = C^T 1 , has the entries 

i-j ?,(*) 


(4.27) 


d, J = \ (- 1 ) 1 ° b J 
J I 0 j > i. 


We conclude that 

'N n {j)-v n {j) . 


7 . 1/2 


j = => D k (Yj, j = 1,..., k) T , 


and the covariance matrix of the limiting Gaussian vector is given by 

U*, = DkRy,kD k j 

where Ry,k is the k x k matrix given after (14.2511 . 

In order to facilitate the calculation of the entries of the matrix Y, k , we write the 
matrix Ry, k in the form 

00 /* OO 

Ry,k = V / ( 2 + S) e ~ {2+3S)x Rm,x dx , 

JO 


m —0 


where 


-1 


TO = 0, 


h m~<, ,0^ m=1;2 ,..., 

and the matrix R m ^ x is a k x k matrix of the form 


^ m,x' 

Here C m>1 is a vector with the entries 


Cm.xi'i) - 


(/,(’) _ i±i v* b^v'O 

^°1 2+5 +^d=l °d Pd J 


e 1 to = 0 


(&i ^ - 6 m + 6 ^ +1 )e i ^ 1 to ^ 1 . 


Therefore, 


°° /*oo 

£*, =Y hm ( 2 + S)e- {2+3S)x D k C^ x C mtX Dl dx. 

m =0 

Note that by (14.2711 and (I4.26|) . for any to ^ 1 and * = 1,2,..., 

(D k Cl tX )(i) = £(-l y- jb f\ b [ j) - b %+ b % + i)e- jx 

3 = 1 

= e- x 6^(-l + e- x ) i " 1 -e- mx 6W(- 1 + e- x ) i “ m + e-( m+1 H^ ) +1 (- 1 + e- a: ) <_m “ 1 . 
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7-2 


Therefore, for to ^ 1 and *, j = 1,2,..., 

{D k Cl >x )(i)(D k Cl tX )(j) = b^b[ j) e- 2x (-l + e~ x ) i+j - 

- (f>S‘ , 6£ > + b$b[ 1> )e- lm + 1 >’(-l + e 'C' " 

+ (+&“+, + 0“K‘” +2, '(-l + e-*) ,+l -"- 2 

+ 1 + e-“) <+ ’ _3m 

- (»s»Sn+ os?k <2 ” +i) '(-i+ e -)‘ +j - 2 ”-‘ 

+ + e-“) ,+ ’- 2 "- 2 


Z =1 


We have: 


pOO 

/ (2 + 5)e-( 2+3 ^0W x (i,j)d a; 

Jo 

p oo 

= (-l) i+ J(2+ <5)6^6^ / e- (4+3 ' 5)a: (l-e- a: ) J+J " 2 dx 

Jo 

= {-l) i+j (2 + 5)&f 6 ^ B (4 + 35, * + j - 1) 


Similarly, 


pOO 

/ (2 + <5)e-( 2 + 3 ^^) x (i,j)da ; 
Jo 


= (_l)7+7-m ( 2 + u , ri 

poo 

I (2 + 6)e-V+ 3S H£Ui,j)dx 


T(to + 3 + 35) (i + j — to — 1)! 
f(i+7T3T35) 


=(-D--»-( 2 ++ e+ j >> r(m+ " + . 3 ?l i , + ? ;,r ~ 2)! 


T(i + j + 3 + 35) 


poo 

J' (2 + 6)e-( 2 +W*eW x (i,j)dx 

= (-1)^(2 + i)W r(2m r(?+j +3 + 3.0 2m>! ■ 


(2 + 5). 


,-(2+3<5)x 


e mAhj) dx 


- <-d«( 2 +«(«e,+e.a r(am+ r 3 ( ^^^- 1)! . 
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/•OO 

/ (2 + S)e-^ +3S ^e^ x (i,j)dx 

Jo 


= (-1)^(2 + S)b^ +1 J£ +1 


T(2m + 4 + 3<5)(z + j - 2m - 2)\ 
r(i + j + 3 + 35) 


Similarly, by (14.271) and (14.231) . for i ^ 1, 


(D k c% iX )(i) 


(2 + 5)T(z + 8 ) _1_ 

T(l + S ) ^ J (i — Z)!(Z — l)!r(Z + 2 + 2S) ' 


Therefore, for i,j ^ 1, 

/•OO 

/ (2 + 5)e-( 2+3 ^(D fc C^)(*)(Co,,^)(j)d a; 
Jo 


(2 + 5) 2 T(i + 5)T(j + 5) 

(r(i + <5)) 2 


vU v - (- 1 )' 1 +' 2 (/ i +/ 2 + 2 + 35)- 1 

^ (* - h)Kh - l)!r(/i + 2 + 28)0 - h)Kh ~ l)!r(Z 2 + 2 + 2«5) ' 

We conclude that the covariance function of the limiting Gaussian process (Z&, k = 
1,2...) in ED is given by 

= T(i + j + 3 + 35) 

OO 

(4.28) ^ (m + <S)p m pW } T(4 + 3<5)(* + j - 2)! 

m= 1 

+ (-l) m (6^6£ ) + + 3 + 35) (i + j — m— 1)! 

+ (-1 r(*M + i + €+i^ } )r(m + 4 + 35)(i + j - m - 2)! 

+ b^b^T(2m + 2 + 35)(i + j - 2m)! 

+ (*M+i + b% +1 b£)T(2m + 3 + 35)(i + j -2 m - 1)! 

+ ^m+i^m+ir(2rn + 4 + 35)(* + j — 2m — 2)!] 

_ (2 + 5) 2 T(z + S)T(j + 5) 

(T(l + 5)) 2 

{-l) h+h (h+l 2 + 2 + 35)- 1 

^ ^ (* - hWi - i)!r(Zi + 2 + 2 8)0 - i 2 )Kh - i)!r(z 2 + 2 + 25 )' 

Once again, it is possible to show that Rz(i,i) > 0 for all i :jji 1. We omit the 
argument. □ 
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5. Concluding remarks 


The model considered here is relatively simple and the calculations are relatively 
complex and it remains to be seen which more realistic models allow us to success¬ 
fully conclude asymptotic normality of degree counts. We are particularly anxious 
to extend our methods to directed graphs where each node is indexed by (at least) 
two characteristics such as in and out degree. Some investigations are currently 
underway for the directed preferential attachment model considered in [3, [71, [9|, ]l0(. 
We also have a program evaluating various inferential methods for estimating model 
parameters which requires asymptotic normality results such as presented here. 
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